Partial least squares: a versatile tool for the analysis of high-dimensional genomic data

نویسندگان

  • Anne-Laure Boulesteix
  • Korbinian Strimmer
چکیده

Partial least squares (PLS) is an efficient statistical regression technique that is highly suited for the analysis of genomic and proteomic data. In this article, we review both the theory underlying PLS as well as a host of bioinformatics applications of PLS. In particular, we provide a systematic comparison of the PLS approaches currently employed, and discuss analysis problems as diverse as, e.g. tumor classification from transcriptome data, identification of relevant genes, survival analysis and modeling of gene networks and transcription factor activities.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spectrophotometric Simultaneous Kinetic Determination of Iodide and Iodate Using Partial Least-Squares Calibration Method in a Single Kinetic Run

A rapid, sensitive and versatile kinetic method is presented for the simultaneous spectrophotometric determination of iodide and iodate by partial least-squares regression (PLS) using original and derivate data named as absorbance and rate data. The method is based on the catalytic effect of the cited anions on the reaction rate between Ce(IV) and As(III) in 2 mol l?1 sulfuric acid medium. The ...

متن کامل

Robust high-dimensional semiparametric regression using optimized differencing method applied to the vitamin B2 production data

Background and purpose: By evolving science, knowledge, and technology, we deal with high-dimensional data in which the number of predictors may considerably exceed the sample size. The main problems with high-dimensional data are the estimation of the coefficients and interpretation. For high-dimension problems, classical methods are not reliable because of a large number of predictor variable...

متن کامل

Methods for regression analysis in high-dimensional data

By evolving science, knowledge and technology, new and precise methods for measuring, collecting and recording information have been innovated, which have resulted in the appearance and development of high-dimensional data. The high-dimensional data set, i.e., a data set in which the number of explanatory variables is much larger than the number of observations, cannot be easily analyzed by ...

متن کامل

An improved structure models to explain retention behavior of atmospheric nanoparticles

The quantitative structure-retention relationship (QSRR) of nanoparticles in roadside atmosphere against the comprehensive two-dimensional gas chromatography which was coupled to high-resolution time-of-flight mass spectrometry was studied. The genetic algorithm (GA) was employed to select the variables that resulted in the best-fitted models. After the variables were selected, the linear multi...

متن کامل

Designing a Commercialization Model for Research Achievements at a Military University Research Institute by Partial Least Squares Structural Equation Modeling

Background and Aim: Today, in universities and research institutes, the lack of attention to commercialization makes it impossible or difficult to enter the markets for technology and research products. therefore, this study aims to design a commercialization model for research achievements of a military research institute. Methods: This descriptive-analytic study was done in a cross-sectional ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Briefings in bioinformatics

دوره 8 1  شماره 

صفحات  -

تاریخ انتشار 2007